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Three-dimensional topological insulators are classified into "strong" (STI) and "weak" (WTI) 
according to the nature of their surface states. While the surface states of the STI are topologically 
protected from localization, this does not hold for the WTL In this work we show that the surface 
states of the WTI are actually protected from any random perturbation that does not break time- 
reversal symmetry, and does not close the bulk energy gap. Consequently, the conductivity of 
metallic surfaces in the clean system remains finite even in the presence of strong disorder of this 
type. In the weak disorder limit the surfaces are found to be perfect metals, and strong surface 
disorder only acts to push the metallic surfaces inwards. We find that the WTI differs from the STI 
primarily in its anisotropy, and that the anisotropy is not a sign of its weakness but rather of its 
richness. 
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I. INTRODUCTION 

Topological insulators (TIs) have recently become a 
very active subject in condensed matter physics. The 
classification of states of matter according to topological 
indices opens new horizons both theoretically and exper- 
imentally, and may hopefully lead to applications^ ^ . 

For more than two decades, topological classification of 
phases was manifested primarily in the realm of the quan- 
tum Hall effect. The experimental observations of TI in 
two dimensions'^ (2D) and three dimension^ (3D) ex- 
panded this notion also to systems that are time-reversal 
(TR) symmetric, and have sparked a "race for golf" for 
new topological phases, and for their unique properties. 

In 2D, TR symmetric band insulators are classified into 
"trivial" and "topological" by a Z2 indeisP. At the one- 
dimensional (ID) interfaces between a topological insu- 
lator and the vacuum (or any other trivial insulator), 
the energy gap must close, implying the appearance of 
counter-propagating chiral gapless modes. As long as 
TR symmetry is preserved, these modes are protected 
from back-scattering and gapping. In contrast, when a 
bi-layer system is formed of two such TIs, coupling of 
the edge modes in the two layers may gap them without 
violating the TR symmetry. 

In 3D, TI's are classified by four Z2-indices {vq, lyjMH 
A non-trivial vq implies that on each 2D surface of the 
sample, the bulk gap is closed by surface states, the spec- 
trum of which consists of an odd number of Dirac cones. 
As long as TR symmetry is preserved and the bulk gap 
remains open, at least one Dirac cone will survive the ad- 
dition of any perturbation. Moreover, the wavefunctions 
of this Dirac cone can not be localized by disorder, and 
the surface of the 3D TI is apparently a perfect metal 
in the absence of electron-electron interactiorPSHUl, Be- 
cause of the robustness of its surface states, this phase 
was called "strong TI" (STP/. 

On the other hand, if fo = but 1/ 7^ 0, the sys- 
tem is in a phase known as a "weak TI" (WTI). This 
phase is adiabatically connected to stacked layers of 2D 
TF^. Suppose we have a cubic sample. The two sur- 



faces which are aligned with the top and bottom layers 
will in general be gapped. But, the four perpendicular 
surfaces have gapless states, at least in a clean system. 
In the limit of completely decoupled layers, these surface 
states are actually the edge states of the stacked 2D TI. 
Translation-invariant coupling between the layers gaps 
out most of these surface states. However, Kramer's the- 
orem ensures two Dirac cones to remain, both centered 
at momenta that are TR invariant. In the following, we 
refer to this type of surfaces, unless otherwise stated. 

The chief reason why the WTI is considered weak is 
that its surface modes may be gapped without breaking 
TR symmetry or closing the bulk gap. In the stacked- 
layers picture, a mass term that gaps the edge modes 
arises if one couples the layers in pairs. The only symme- 
try violated by this term is the lattice-translation symme- 
try. Therefore, it appears that this symmetry is essential 
for the topological protection of the WTI surfaces. Since 
disorder breaks translational symmetry, one may be led 
to assume that the WTI surfaces are no longer protected 
and behave like conventional 2D metals with strong spin- 
orbit couplings. Such metals are known to undergo an 
Anderson transition from metals to insulators as a func- 
tion of disorder strengtlP^. 

In this paper, we show that the contrary is true. We 
consider the effect of disorder on the weak TI, and show 
that it is actually not weak at all. In Sec. |Tlj we show 
that the conductivity of the non-trivial surfaces of the 
WTI remains higher than e^//i in the presence of disorder 
of arbitrary strength, as long as the bulk gap and TR 
symmetry are maintained. 

Section [TTT| includes perturbative analysis. In the limit 
of weak disorder, we evaluate the weak localization cor- 
rection, and find it to be anti-localizing. In the opposite 
limit, we consider strong disorder that is limited to sev- 
eral atomic layers at the surface of the insulator. We find 
that such disorder makes the surface insulating, but cre- 
ates a perfect metallic sheet just beneath the disordered 
surface. We also discuss the conductivity in the interme- 
diate disorder limit, and raise the possibility that a phase 
with a universal finite conductivity appears. 
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In light of these results, we discuss in Sec. IV the 
unique surface anisotropy of the WTI, which implies that 
the robustness of the conductivity of a surface strongly 
depends on its orientation. Based on this anisotropy we 
raise the possibility of surface engineering. 



II. FINITE CONDUCTIVITY 

Assume one stacks an even number of 2D TPs, and 
couples them in pairs. Each such pair is topologically 
trivial, and generically has an insulating edge. Thus, 
in the 3D limit of an infinite even number of layers the 
surfaces are generically insulating. On the other hand, 
if the number of layers is odd, there is no way to gap 
all the edge modes without breaking TR symmetry, and 
the surface must be conducting. This sensitivity to the 
parity of the layer number was then argued to imply the 
fragility of the WTM 

While the argument for non-triviality in the odd case 
relies on topology, the argument for gapping of the sur- 
face modes in the even case relies on a well-tailored 
perturbation that couples the layers in pairs. Random 
disorder does not induce such a coherent perturbation. 
Rather, when disorder is present and the number of lay- 
ers is even, the surfaces may be trivial, yet do not have 
to be. On the other hand, for an odd number of layers, 
the surfaces must conduct. This suggests that when the 
coupling between layers is disordered, the odd behavior 
is in fact the generic one, thus the surfaces will conduct 
for any large number of layers. 

This heuristic argument will now be put on firm theo- 
retical ground. Consider a WTI of dimension which 
is adiabatically connected to an odd number of 2D lay- 
ers stacked along the z direction and L ^ 1. Note that 
we take the lattice spacing to be 1 . We take the pe- 
riodic boundary conditions to be periodic in the z and 
X directions, and open in the y direction, as illustrated 
in Fig. [ija). Under these boundary conditions, the sur- 
face states reside on the interior and exterior surfaces 
of a thickened torus. We allow for any disorder which 
is TR symmetric, does not close the bulk gap, and has 
a correlation length much smaller than L. Under these 
conditions, the surfaces have no special regions or lines 
to which the electrons wave functions could be restricted. 

Consider an Aharonov-Bohm flux that implements a 
phase twist in the periodic boundary conditions along 
the X direction, as illustrated in Fig. [iJa). Let us 
study how the spectrum of the edge modes depend on 
(j). For (/) = 0,7r, the Hamiltonian is TR symmetric, and 
Kramer's theorem guarantees that all the energies are 
doubly degenerate. Apart from these degeneracies, the 
spectrum has no accidental degeneracies, as implied by 
the non-crossing theorercfi^. This ensures a well-defined 
labeling of energies as a function of 0, i^i (</>), where 
i ~ 1,2, . . . and Ei+i > Ei. 

The difference between topologically trivial and non- 
trivial surfaces is manifested in the relation between the 
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FIG. 1: Topologically trivial and non-trivial pair switching, 
(a) The WTI is adiabatically connected to stacked layers of 
2D TI. We consider a WTI of dimension with = (0 1) 
and z as the stacking direction. The boundary conditions 
(green dotted lines) are periodic in the z direction and are 
twisted by an Aharonov-Bohm flux in the x direction (green 
thick arrow). The remaining xz surfaces are metallic, (b), (c) 
Typical patterns of energies of surface state as a function of 
the Aharonov-Bohm flux, (j), for (b) trivial and (c) non-trivial 
surfaces. Kramer's theorem assures that at the time-reversal 
invariant fluxes = 0, tt, states come in degenerate pairs. On 
a trivial surface, the pairs remain the same between these two 
values, while on a non-trivial surface the pair switch partners. 
The mean level spacing of the surfaces states is A/L'^, where 
A is the bulk gap. We show that non-trivial pair switching 
implies the existence of at least 0{L) extended states. 



pairs of degenerate states at = and at TiEIillil!^ which 
is illustrated in Fig. [ijb) and[l]^c). If the pairs at = 
are the same as those at = tt, the surface is topologi- 
cally trivial. This is the case for a trivial insulator and 
for a WTI with an even number of 2D layers. In con- 
trast, if the pairs switch partners between = and 
TT, the surface is non-trivial. Such pair switching takes 
place on the surfaces of an STI and of a WTI with an odd 
number of 2D layers. The zigzag shape of the spectrum 
in the non-trivial surfaces cannot be terminated without 
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approaching the bulk states. Hence, in this case 

J2\E,{7T)-Em>\ (1) 

i 

where the summation is over aU surface states and A is 
the bulk gap. Note that for any finite L, the number of 
surface states is proportional to L^, and the mean level 
spacing is A/i^. 

It is impossible to satisfy inequality ([T]) if all the surface 
states are exponentially localized. The current of a lo- 
calized state is exponentially small with the system size. 
The current carried by an electron in the i^^ eigenstate 
is given by /z(</f') = {e/h)d^E^. Therefore, d^Ei ~ e~^, 
and consequently, |£'i(7r) — £'i(0)| ^ e"^. In that case 
inequality ([T]) cannot be satisfied. 

Furthermore, li = e{v)i/L, where {v)i is the expecta- 
tion value of the velocity. Since, the velocity is bounded 
by intrinsic variables and cannot increase with the system 
size, li approaches zero at least as 1/L. Note that there 
is a value (f>Q such that \Ei{TT) — Ei{0)\ < {Trh/e)\Ii{(j)o)\- 
Therefore, for the inequality ([I]) to be satisfied, there 
must be at least 0{L) delocalized states. Furthermore, 
as long as the system is homogeneous, which is the case 
for random disorder, these states are distributed all over 
the surface. 

Imagine now cutting the system into two subsystems, 
one with even and one with odd number of layers. Since 
the cut is a surface effect and the system is homogeneous, 
it will not localize a state that has been delocalized be- 
fore. Thus, in the presence of random disorder, delocal- 
ized states will exist also in the subsystem with the even 
number of layers. We can therefore conclude that in a 
system with an even number of layers there are delocal- 
ized states in the presence of random disorder, despite 
the absence of topological protection. 

The homogeneity of the disordered system leads to a 
further consequence. Suppose we have cubic slabs of di- 
mension . According to what we have seen before, on 
the surface of the small cubes there are at least 0{l) de- 
localized states on the scale of I. Now we glue the cubes 
to one another and obtain a larger cube of dimension 
L^. Since the gluing process does not localize states, on 
the surface of the large cube there are at least 0{L^ /P) 
delocalized states on the scale of I. This scaling is con- 
sistent both with delocalization of all states, and with a 
scenario of localized states with a broad distribution of 
localization lengths. 

Finally, we notice that for the current li to decay as 
l/L, the electronic motion must be ballistic. If, however, 
the motion is diffusive, then the current decays as 
In ballistic motion, inequality ([ij required 0{L) delocal- 
ized states. In contrast, a diffusive motion requires 0{L?) 
such states. Since ballistic motion is unlikely in the pres- 
ence of disorder, the bound of 0{L) states is probably 
too restrictive. 

Having showed the existence of delocalized states, we 
can turn to estimate a lower bound for the conductivity. 



We use the Thouless formula, which relates the electri- 
cal conductivity to the sensitivity of energies to phase 
twistPHUl, 



where (AE/Acj)) denotes geometric mean of the energy 
difference Eiiir) — Ei{0) averaged over eigenstates and 
dN/dE denotes the density of states, both at Fermi en- 
ergy. This relation has been shown to be only qualita- 
tively correctP^I. For example, in ID systems the conduc- 
tivity scales hke [{AE / A(j)){dNjdE)f'^^ , and constants 
of order unity may appeaJ^^ESl. Moreover, discrepan- 
cies of 0(1) may appear if the relation is expressed with 
d'^E/{d4>y, rather than with AE/Ac^^. Nevertheless, 
when {AE/ A(f)) (dN/dE) is of the order of unity, the con- 
ductivity is expected to be of the order of e'^/h. 

In the non-trivial pair switching, see Fig. Iljc), the 
zigzag shape of the spectrum relates {AE/A^ to the 
energy levels spacing Ei+i — Ei. And since the level 
spacing is the inverse density of states, it leads to 
{AE/ Act)) (dN/dE) > 1. Consequently, 

We have therefore arrived at our key result: a non-trivial 
surface of a WTI will remain conducting even in the pres- 
ence of random disorder. 

Preliminary numerical work, reported in Appendix [A} 
indeed shows that as the number of stacked layers in- 
creases, the even-odd difference diminishes, and both 
tend to lack of localization. 



III. PERTURBATIVE ANALYSIS 

The topological argument allowed us only to bound 
the conductivity. More quantitative predictions can be 
given in the limits of weak disorder and strong surface 
disorder, where perturbative approaches can be utilized. 
Disorder is defined to be weak when EpT ^ 1, where Ep 
is the Fermi energy and t is the mean free time. In this 
limit we evaluate the lowest-order quantum correction to 
the conductivit}^^. 

The low energy effective Hamiltonian describing the 
surface of a WTI in the clean limit consists of two decou- 
pled Dirac cones 

H(kx,ky) ^ va(kxr ® Sj. + kyl ® Sy). (4) 

For every value of k^ and ky this is a 4x 4 matrix, spanned 
by a direct product of two Pauli spinors: r that denotes 
the Dirac cone, and s that denotes the electron spin. Here 
Wo is the velocity characterizing the Dirac cones, and I* 
may be either the unity matrix / or the Pauli matrix 
Tz, depending on the particular WTI considered. The 
corresponding TR operator is Tw = I ® iSyK, where K 
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denotes complex conjugation. Accordingly, = — 1. 
Notably, since under TR each Dirac cones is mapped to 
itself, in general their chiralities are unrelated, as well as 
the energy of the Dirac points. 

Disorder adds to the Hamiltonian a sum of the form 
n '^mn{'^){Tm ® whcrc the indiccs m, n take the 
values 0, x, y, z and tq — st) — I. For the WTI only 
six terms are TR symmetric: I(E)I,Tz(E)I,Tx(S>I and 
Ty (E) s. The first three describe potential disorder, and 
the last three describe random spin-orbit scattering (note 
that the clean Hamiltonian already includes spin-orbit). 
Among the six, only the term Ty <Si Sz gaps the spectrum. 

In Appendix [B] we evaluate the lowest order quan- 
tum correction to the conductivity from the low energy 
Hamiltonian Q , in the presence of all mentioned types of 
disordeil^S. We find this correction of be anti-localizing, 

dlnL - 27raJ^ 2 j 

where axx = o'xx{h/e'^) — Eprfy. Furthermore, 2/3 < 
/„ < 2 is the vertex correction, and — 1 < /e < 1 is 
a correction of the Cooperon, both are determined by 
the details of the disorder. Equation ([5| implies that 
the conductivity flows towards a perfect metal, and a^x 
increases logarithmically with the system size. 

The Hamiltonian Q appears also in two other 2D sys- 
tems, and it is instructive to elucidate the similarities and 
differences between these systems and the WTI. The first 
system is that of spinless electrons in graphene. For that 
system I* = r^, and s denotes the sublattice index. Ac- 
cordingly, its TR operator is Tq = Tx® IK and Tq — 1. 
By plugging Tq instead of Ty/ into Eq. (Isl), w e observe in 
graphene weak localization, as expectecP^ESl. As a matter 
of fact, since for the WTI Ty^ = —1, for generic disorder 
the Hamiltonian belongs to the symplectic class, which 
is known to have weak anti-localization correctiorfSl. In 
contrast, spinless graphene, for which Tq = 1, belongs 
to the orthogonal class, which shows weak localization. 
The general relation between the symmetry class and the 
sign of the weak-localization correction can be shown us- 
ing the non-linear a model approacH-SI^ but may be also 
understood more directly by means of interference of dif- 
fusive trajectories, as shown in Appendix [Cj 

The second system is that of a 2D insulator at the tran- 
sition point between a trivial and a topological phase in 
the absence of inversion symmetrj^^. This system be- 
longs to the same symmetry class as the WTI, but its 
spectrum does not exhibit pair switching as a function 
of flux. Nonetheless, since it is tuned to a phase tran- 
sition, one expects the correlation length to diverge and 
therefore delocalized states must exist at low energies. 
In Refs.'22Hlll it was established that a band of delocal- 
ized states appears around zero energy, while far from 
zero energy states are localized. This should be com- 
pared with the WTI, where, as we have argued, states 
remain delocalized for any sub-gap energy. This discrep- 
ancy suggests that while these models have similar low 




FIG. 2: Renormalization group flow of the conductivity. The 
/^-function of the dimensionless conductivity a^x for a non- 
trivial surface of the WTI, compared with that of a 2D metal 
with strong spin-orbit coupling that belongs to the symplectic 
class (blue). According to Eq. (|5| in the limit of high con- 
ductivity the flow is toward a perfect metal. We showed that 
the conductivity of a WTI surface cannot drop below e^/h. 
Consequently, two types of flows are possible: (I) always flow- 
ing towards a perfect metal (yellow), and (II) flowing with a 
stable fixed point of finite conductivity (red). 



energy descriptions, they are nonetheless different when 
the entire spectrum is taken into account. 

Notably, following the posting of this manuscript on 
the arXiv, our prediction for delocalization within the 
low energy theory, Eq. Q, was validated numericallj^^. 
The restriction of this numerical work to low energy does 
not address directly the role of pair switching. Indeed, 
one can find a unitary transformation that maps the low 
energy Hamiltonian used there, including the disorder 
terms, to the low energy part of the Hamiltonian used in 
Ref.'^^ to describe the transition mentioned in the previ- 
ous paragraph, which exhibits no pair-switching. 

Having shown that in the limit of weak disorder we 
have a perfect metallic surface, we now turn to oppo- 
site the limit of extremely strong disorder. In this limit 
the strength of the disorder is much larger than all the 
other energy scales, including the bulk band width and 
band gap. If such disorder acts on the entire 3D system, 
it mixes the bulk bands and makes the entire sample a 
trivial insulator. However, an interesting case is disor- 
der that is limited to several of the outmost layers. This 
may actually happen in realistic surfaces, which are usu- 
ally made dirty by oxides and other dopants. Moreover 
as we show below, it also reveals the role of the bulk in 
protecting the surface states. 

Let us divide the Hamiltonian of the three dimensional 
system, H^d , into the part that operates only within the 
clean bulk (Hq), the part that operates on the disordered 
surface layers (iJdis), and the part of hopping between the 
two (V). The Hamiltonian may now be written as 
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We begin with the case where all the eigenvalues of iJdis 
are greater in absolute value than some value W, and 
W ^ t, where t is the bulk band width. For this case, all 
the eigenstates of Hdis are localized on the surface. These 
eigenstates may be considered as an high energy sector, 
which can be integrated out. To this end, we consider 
the Green's function projected onto the Hilbert space of 
the clean bulk'^®, using the projection operator Pq 

PoiE-HsDr^Po= (7) 

{E-Ho- y(ffdis - E)-W^ + 0{V^)y' . 

This Green's function defines an effective Hamiltonian 
for the clean bulk, which is 

ffeff(S) =Ho + y(i/di. - E)-^^ + 0{V^). (8) 

This effective Hamiltonian describes the degrees of free- 
dom of a 3D WTI which is clean in the bulk (the first 
term) , and is disordered at its surface (the second term) . 
Note that this surface lies beneath the physical surface, 
where the disorder vanishes. The second term of Eq. Q 
represents virtual hopping from the bulk to the strongly 
localized states at the physical surface and back. Since all 
the eigenvalues of Hq, the matrix elements of V and the 
energy E are of the order of t, this bulk-surface coupling 
is of the order jW ^ t. The effective Hamiltonian then 
describes a weakly disordered WTI, the gapless states of 
which are located underneath the physical surface. Re- 
calling the above result, we can see that the relocated 
surface states form a perfect metal. 

The same holds when the spectrum of H^^s becomes 
continuous. The states in the strongly disordered layers 
with energy greater than W can still be integrated out, 
resulting in small 0{t^ /W) terms. The remaining low 
lying states are expected to be localized, since i?dis alone 
is not protected from localization. Such states act as 
strong scatterers. However, their density is of 0{t/W), 
and is therefore small, yielding a long mean-free path. 
Hence, we are still in the limit of weak disorder, thus 
having a perfect metal. 

Intermediate disorder is disorder with EpT <^ 1 but 
At ^ 1. According to the topological argument, the 
conductivity has to be larger than e'^/h even in this 
regime. Following the single parameter scaling approach, 
two possible flows of the renormalization group may 
arise, which can be presented in terms of the /3-function, 
Pi^xx) = d\naxx/dlnL. In one flow, the conductivity 
always flows to infinity while increasing the system size, 
as presumably happens in the STI^" In the second 
flow, a stable fixed point appears at a^r,. « 1, and a crit- 
ical point appears for some dxx > 1- The two flows are 
illustrated in Fig. [2pZl 

IV. SURFACE ANISOTROPY 

In the previous sections we analyzed the conduction 
properties of surfaces of the WTI, and found that they 



are conducting even in the presence of disorder. This ro- 
bustness brings the WTI closer to the STI in terms of 
their transport properties. Nevertheless, the WTI differs 
from the STI in the unique anisotropic behavior of its 
surfaces, which gives rise to the idea of surface engineer- 
ing. 

While non-trivial surfaces of the WTI are indeed ro- 
bustly conducting, not all possible surfaces of the WTI 
are topologically non-trivial. For example, we mention 
that in the stacked-layers picture the top and bottom sur- 
faces are topologically trivial, and are generally gapped. 
For given weak indices u and a plane with Miller indices 
h, we define the relation ft, ~ i/ by 

(h, - Vi) mod 2 = 0, (9) 

for i = 1,2,3. Any surface with Miller indices that sat- 
isfy this relation is topologically trivial, whereas a sur- 
face with h u is topologically non- trivial'^; The rea- 
son of this criterion is that the indices vector v does 
not uniquely define a stacking direction, and any vec- 
tor h ^ u can be a stacking direction, as illustrated in 
Fig. [3](a). Namely, the stacked-layers picture is a the- 
oretical construction rather than a physical description, 
and in practice, the WTI does not have to be layered. 

An alternative explanation for criterion ([9]) can be 
given from the picture of a fixed stacking direction and 
varying surfaces. Consider a WTI, the primitive lattice 
vectors of which are 01,02,03, making the lattice sites 
located at r„ = X]?=i ^iO'i- Consider also a surface with 
Miller indices h, and for simplicity place the origin of the 
coordinate system at some lattice site of the surface. By 
definition, all the lattice sites on the surface satisfy the 
condition h ■ n = 0. For simplicity, we take the example 
oi v> = (0 01) and choose it to be the stacking direction. 
li h ^ V, then /13 is odd, while hi and /i2 are even. Ac- 
cordingly, on the surface all the na coordinates are even, 
and adjacent surface sites differ by an even increment of 
713. Therefore, the surface is composed of steps of an even 
number of layers, as illustrated in Fig. [3|^a), and the cou- 
pling between them will gap the edge states. For h v, 
the surface is composed of steps of odd layers. Now, the 
coupling can not gap all the edge states, and the surface 
will conduct. 

The high and non-trivial sensitivity of the surfaces 
to their orientation even in the presence of disorder is 
demonstrated in Fig. [Sjb). We considered a 20 x 20 x 20 
lattice of the Fu, Kane, and Mele model with Xso = t and 
St = [—0.6, 0, 0.2, 0]t, which corresponds to z^o = and 
u — (001) with a bulk gap of A = 0.8t. In this model 
v represents also the weak hopping direction. Uniformly 
distributed strong disorder of magnitude t was also in- 
troduced. The figure depicts the local density of surface 
states integrated over an energy window \E\ < 0.1 A. 
The parallelepiped is cut along the primitive vectors, 
and therefore has 2 trivial gapped faces and 4 topological 
metallic faces. 

The criterion for a surface h u implies that the 
spectrum on it will be gapped, but it does not provide 
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(a) hi = (0 1) v = (001) 




FIG. 3: Insulating and metallic surfaces, (a) The WTI has 
both trivial and non-trivial surfaces. A surface with Miller 
indices h is trivial if {h — v) mod 2 = 0, denoted hy h ~ v, 
since any such h can denote a stacking direction. The figure 
depicts two trivial surfaces hi = (0 01) and h2 — (2 01) for a 
cubic crystal with assumed i> = (0 01). Both hi and /i2 are 
legitimate stacking directions. Alternatively, for a stacking 
along hi, the /12 surface is composed of steps of two layers. 
The coupling between the layers gaps theirs edge states. For 
h 00 u, the steps will be of odd number of layers and will 
therefore conduct, (b) An example of the surface anisotropy 
in the Fu, Kane and Mele model of the weak 1/ = (0 1) phase. 
Depicted is the local density of surface states integrated over 
an energy window \E\ < 0.1 A, with disorder strength compa- 
rable to the bulk gap. The surfaces of the parallelepiped are 
spanned by the primitive vectors. The two faces with Miller 
indices equal to 1/ are gapped, while the other four, with or- 
thogonal Miller indices, are metallic. By controlling the cleav- 
age process, the conductance of each face of the WTI can be 
engineered. 



information on the magnitude of the gap. In the above 
example, for h chosen to be in the weakest hopping di- 
rection, the energy gap on the surface is comparable to 
the bulk gap. Other trivial surfaces have energy gaps 
much smaller than this value. The influence of disorder 
on the gap and localization length of such surfaces may 
be dramatic. We note that for a surface that cannot be 



described by Miller indices, we expect metallic behav- 
ior, since the scaling argument which was used to ensure 
f^xx > e^//i seems to hold. 

By noticing that the topological and trivial surfaces 
are isotropically distributed, one can imagine creating a 
sample with each face engineered to be either gapped or 
metallic. A gapped surface along a stacking direction 
would remain insulating, while other surfaces will con- 
duct. Provided rather good control on the cleaving pro- 
cess, various different electronic behaviors are expected 
on different surfaces, ranging all the way from perfect 
metals to insulators with varying gaps. In light of these 
results, we find that the anisotropic behavior of the WTI 
surfaces is not a sign of their weakness, but rather of their 
richness. 



V. SUMMARY 

In this work, we showed that the name "weak topo- 
logical insulators" does not do justice to the phase it de- 
scribes, since the electrical conductivity of the non-trivial 
surfaces of such insulators is not suppressed by disorder. 
The WTI shows unique sensitivity of the electronic prop- 
erties of its surfaces to their orientation, and that may 
provide an experimental tool for controlling these prop- 
erties. We hope that this work will serve as a trigger for 
further study of these interesting topological phases. 
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Appendix A: Numerical analysis of disordered thin 
WTI 

In an attempt to address numerically the effect of dis- 
order on the conductivity of a gapless surface of the 
WTI, we considered the Fu, Kane and Mele modeP 
with A50 — t and 6t = [— 0.6, 0, 0.2, 0]i, which corre- 
sponds to a 1^0 = and v — (0 01) with a bulk gap of 
A — 0.8t. We also took the chemical potential to be at 
the Dirac points of the surface spectrum. The most gen- 
eral potential disorder that is symmetric to time-reversal 
was included by adding a time reversal symmetric ran- 
dom matrix which acts within unit cells. The entries 
of each matrix were sampled from a uniform distribu- 
tion in some region [—w/2,w/2\, and the resulting ma- 
trix was then symmetrized with respect to time rever- 
sal. The disorder was added on three outmost layers 
with w = 0.5t, 0.5e~^t, 0.56""^^ corresponding to the first, 
second and third layer respectively. The samples sizes 
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Lx'xLyX Lz ranged in from 40 x 10 x 1 up to 120 x 10 x 6 
unit cells, where Lz can be thought as the number of the 
stacked 2D layers. 

In order to obtain the conductance gxx we used Eq. (2) 
of the main text. When applied to a quasi-lD sam- 
ple this equation yields the conductance rather than the 
conductivity'^. The fluctuations in energy levels follow- 
ing the insertion of a 7r-twist were approximated by ex- 
trapolating the derivative of the energy levels with re- 
spect to the phase twist. The geometric averaging was 
taken over the different instances of disorder and over 
an energy window of [— 0.2i, 0.2i]. Although this second 
averaging is not included in the definition, we find that 
it did not have significant influence on the asymptotic 
behavior. We considered 30 instances of disorder for 1-3 
layers, and 10 instance of disorder for 4-6 layers. The er- 
ror bars are primarily due to fluctuations of the density 
of states which limit the accuracy of the estimated mean 
value. 
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We note that a similar behavior was obtained in 
for symplectic multichannel ID wires. This model is close 
to ours, but with one important difference. In multichan- 
nel ID wires all the channels are coupled, while in the 2D 
surface of WTI only nearby channels are coupled. 



Appendix B: The first order quantum corrections to 
the conductivity 

In this appendix we derive the lowest order quantum 
corrections to the electrical conductivity of WTI and 
spinless graphene. While the former is our main inter- 
est, we find it instructive to compare it to the latter. 
Our starting point is a low energy effective Hamiltonian 
for both systems. The Hamiltonian is composed of two 
decoupled Dirac cones 



= -ivoidxl* 'Si Sx + dyl 'El Sy), 



(Bl) 



where Si are Pauli matrices associated with the spin (sub- 
lattice) index of WTI (graphene) , and the matrix /* is a 
Pauli matrix associated with the valley index (cf. Eq. (4) 
in the main text). For WTI, /* denotes either I2x2 or Tz, 
while for graphene /* = r^. The corresponding retarded 
and advanced Green's functions are given by 



^4 



> Sx + VokyI (g) Sy 



(B2) 



where E± — lim,j_>.o+ E ± irj. Time reversal invariant 
potential disorder is introduced via the matrix V(x) 



H = Ho + V{x), 
V{x) ^^vi{x)Ai, 



(B3) 
(B4) 



FIG. 4: The conductance g^^:, for a WTI with = (0 1) 
of the Fu, Kane and Mele model as estimated from the flux 
sensitivity of surface state energies multiplied by the density 
of states. Each line corresponds to a given number of layers 
(1/2 ) and shows Qxx as a function of L^, where Ly is fixed 
to 10. Periodic boundary conditions were imposed on z and 
X. Samples with an odd number of layers have a topologi- 
cally protected minimal conductance of /h. Samples with 
an even number of layers are (strictly speaking) topologically 
trivial and show a localization behavior. However their con- 
ductance converges to that of the odd layers as the number 
of channels is increased. 



The dependence of gxx on the dimensions of the surface 
is depicted in Fig. [4] For odd Lz the conductance tends to 
values close to the e^//i, and shows no sign of localization. 
For comparison, the localization length of a sample with 
disorder of a similar strength that does not satisfy time 
reversal symmetry is around 40 unit cells. For even Lz 
a finite localization length is apparent, which however 
increases with Lz- It therefore appears that for large Lz 
the even curves will converge to the odd curves, meaning 
a lack of localization for ^ 1. 



where Ai are 4x4 time-reversal symmetric Hermitian 
matrices of the form Ti ® Sj for i, j = 0, x, y, z. The vi(x) 
are uncorrelated random functions 



{vi{x)vi,{x')) = wi6ii,6{x - x'). 



(B5) 



As mentioned in the main part of the paper, the time- 
reversal operator T is different for spinless graphene and 
WTI. For spinless graphene, the time-reversal operator 
switches between the two Dirac points, but does not af- 
fect the sublattice. Therefore, Tg = Tx^IK, where K de- 
notes complex conjugation. On the other hand, for WTI 
it flips the spins but does not affect the valleys, since 
the Dirac points are at time-reversal-invariant momenta. 
Therefore, Tw = I^SyK. Consequently, Ty^^ = — 1 
while Tg^ — 1. As argued in the main work using the 
particle diffusion picture, the signs of the quantum in- 
terference correction to the conductivity is expected to 
be given by —T^. Another consequence of the differ- 
ence in T is that the Ai matrices which commute with 
Tg are all the combinations of (/, Tx, Ty) ® (J, Sx, Sz) and 

/, Ty ® Sxi'Ty ® ^yi'^y 



I®I,Tx 



Sy, while the matrices which commute with T^y are 



•I,Tz 
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Due to extra symmetries of Hq, there are additional 
anti-unitary operators which commute with Hq. For ex- 
ample, for Hq in which /* = /, all the r^T^ matrices are 
such operators. If one chooses disorder that commutes 
with TiTw, rather than with Tw, then the sign of the 
quantum correction will be —{TiTw)'^- 

Our goal is to find the changes in the disorder-averaged 
conductance as a function of the linear size of the system. 
The zero-temperature mean longitudinal conductance in 
the X direction is given bjPHl 



bulk gap (in WTI) or the bandwidth (in graphene). In 
the following we ignore the real part of the self-energy, 
since it only corresponds to a shift in the energy. The 
disorder averaged Green's function is now given by 

^R/Af, E±iT + ygk^r (g> s.^ + vpkyl (g> Sy 



e 

2TTh 



where (..) denotes averaging over disorder, and Jx = 
VqI* ®Sx is the current operator. The diagrammatic way 
to find this mean value combines the disorder averaged 
Green's function, the vertex correction, the Cooperon 
and the dressed Hikami box. 

Our derivation follows McCann et al. in Ref."^*^- for 
spinless graphene, but with three substantial differences. 
First, we address here both the WTI and graphene si- 
multaneously in a way that emphasizes the differences 
between them, both in the Hamiltonian and in the result- 
ing correction. Second, the only assumption we make on 
the disorder is that it is symmetric with respect to time 
reversal. We do not assume a dominance of one type 
of scattering over another. Consequently, the numerical 
prefactor of the fi function depends on the details of the 
disorder, and these details may affect it by a factor of 
up to 1/3. Last, since the spectrum of WTI far from 
the Dirac point is not universal, we adopt a different reg- 
ularization approach for diverging integrals. Instead of 
introducing a triangular wrapping, we limit the minimal 
length scale of the scatterers. For alternative approaches 
for dealing with this issue see Refs.l^^Ml 

We begin with calculating the self-energy within the 
self consistent Born approximation, given by 



(B6) 



Since Aj^ — I ^ I, and the angular integration over p 
leaves only the diagonal term in 



[sf]„-(g,£;) = 4 



2r 

TT E 



where i,j — 1..4. In the limit of weak disorder 



(B7) 



(B8) 



The level width T is related to the mean free time r 



through T = l/2r. In order to obtain Eq. (B8| we have 
introduced an ultra violet cutoff A, which physically cor- 
responds to the characteristic inverse size of the impuri- 
ties, and we assumed that vqA is much smaller than the 



/ 
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FIG. 5: The leading self-energy diagrams in the Born ap- 
proximation: (a) first order, and (b) , (c) second order. 



The self-consistent Born approximation includes dia- 
grams where disorder lines do not intersect, such as de- 
picted in Figs. [5](a) and [sjb). It leaves out diagrams 
where disorder lines intersect, such as Fig. [Sfc). We find 
the self-consistent Born approximation to be valid when 



E 



< 1 and 



^1 ^o^^^i 



(B9) 



Note that due to the logarithmic factor, even when vqA 
becomes much larger than E there is still a wide parame- 
ter range in which the conditions are satisfied. Moreover, 
within this range the omission of the real part of the self 
energy is consistent. 

In the limit of weak disorder diagram of Fig. [5jb) is 
included in our approximation, but its contribution is 
smaller by a factor of a relative to that of Fig. [sjja). Its 
contribution is 



LI' •' 



(BIO) 



AiG^{p^,E)AvG^{p^ - p2.E)AiG^'{p^,E)Av. 

Due to the nested structure of the diagram, the integra- 
tion over the two loops can be carried separately. The 
contribution of the diagram is therefore 



{T/EFf\n{voK/EF) 



(Bll) 



Indeed this contribution is negligible for a ^ 1. The 
crossed diagram, which is depicted in Fig. [sj^c) , can also 
be shown to be of 0[c?). 

The self-consistent equation of the vertex correction is 
schematically illustrated in Fig. l6la). If we denote the 



corrected vertex by Jx, then 



Jx[q,E) =voI* ®s 



(B12) 



E 



Wl 



AiG^(p, E)Mq, E)G''{p + q, E)Ai 



For q = we guess a solution of the form Jx{0,E) — 
fvol* (g) Sx, which gives 

fr(SSx=I*^Sx + fJ2 wiMl* ® Sx)Ai (B13) 

(27r)2 [{E + ir)2 - Vq^P^][{E - ir)2 - 1,^2^2] ■ 

Due to x-y symmetry the terms with momenta in the nu- 
merator vanish. Moreover, Ai{I* s^) = (8 Sx)Ai, 
where ^/ = ±1. Therefore J^i^i^ii^* Sx)Ai = 
(Si 'S^^x), and the matrix structure of the equa- 

tion is satisfied. After integrating wc find that 

(B14) 

where we used the fact that T E. We can therefore 
conclude that the vertex correction is 



</<2. 



(B15) 



(2^)2 



The next task is to solve the self-consistent equation 
of the Cooperon, which is depicted in Fig. [6]^b), 

Cfe)(„™)(fc,fe";^,Q,w) = (B16) 

3'){i" J")i^ ; E, Q, Uj)V(^i"j")(nm) 

(27r)2 ^(*j)(»'j')n(i'j')(«"i")(^'; ^) 
X C^,„j.)(^rim){k',k";E,Q,uj), 

/ 

n(,j)(„„,)(fc';i;,Q,w) = (B18) 
[G«(fc' + Q,E + uj)U[G^{-k\E)]j„^- 

This equation can be considered as a matrix equation 
for C{E, Q, uj), which acts on the vector space |fc) ^ 
where k denotes the momenta, and ij denote the internal 
degrees of freedom of the two particles (of dimension 16). 

Anticipating an infrared divergence which is propor- 
tional to a diffusive propagator, the Cooperon may be 
presented as 



DQ2 _ 
- {regular terms). 



(B19) 







k"+Q 



FIG. 6: Diagrammatic representation of the self-consistent 
equation for (a) the dressed vertex and (b) the Cooperon. 



equation for the diverging term 

\d){d\ 



Plugging this ansatz into Eq. (B16I, we can extract an 

= VUV. (B20) 



c(l - VU) 



DQ^ - iLo 



Multiplying from the left with [VUVy^ and fr om the 
right with \d), gives an eigenstatc equation for the diffu- 
sive mode 

(UV)-^ {V-'^ - n) Id) = c-^{DQ^ - iuj)\d), (B21) 
where 

{V-^ - n) = wiAi ® A^j (B22) 
d^k 1 

(27r)2 [{E + UJ + ir)2 - v^^k^] [{E ~ ir)2 - vo^k^] 

X {E + LO + iT + Vokxl* ® Sa; + VokyI (8) Sy) 

® (i? — iF — vokxl* Sx — VokyI (g) Sy). 

The terms which are linear in kx and ky vanish in the in- 
tegration, and the remaining three integrals, which mul- 
tiply the three matrices (/(g)/)(g)(J(g)/), {r(8)Sx)(8){I*(8)Sx) 
and (/ €5 Sy) €5 (/ (8) Sy), are respectively 



(B23) 



/ 



d^k {E 


+ Lu + iT){E 


-^T) 


(2^)2 [{E + trp- 


-vo'kmE- 


iF)2 - VQ^k^] 








2 






d^k 






(2^)2 [(^ + lF)2- 




iF)2 -2, fc2] 


4 


0{T/E,a), 




d^k 






(2^)2 [(^ + jF)2- 


-Vo'kmE^ 


iF)2 - l,(j2p] 


4 


0{T/E,a). 





(B24) 



(B25) 
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Therefore 



(B26) 

X ^ U - ^ [(/* «) s^) ® (/* «) s^) + (/ (g) Sj,) ® (/ ® Sj,)] 



We are interested in the zero mode of the above matrix. 
Since V mixes aU momenta equahy, any eigenvector of 
V^^ which depends on momenta will have a diverging 
eigenvalue, and cannot give rise to a zero mode in the 
above equation. For generic disorder which respects time- 
reversal symmetry we find that the zero mode for WTI 
(with I* = I for concreteness) and graphene are given by 







FIG. 7: The leading order quantum corrections to the con- 
ductivity. These diagrams can be viewed as a combination of 
a dressed Hikami box with the Cooperon. 



{k,ij\dg) = Ss^^SjiST,,lSrj-i + Sr^^-lSrj^l)/2, 

where (si) is the valley (spin/pseudospin) subindex 
of the index i. This can be easily verified by the facts 
that Ai\d} = \d), (/* (g) s^) «) (/* g) s^)\d) = -\d), and 
(/ g) Sy) (g) (/ Sy)\d) = -\d). Note that the vector \d) 
has an eigenvalue of 1 with respect to HV, and therefore 
{IlV)-'\d) = \d). 

T he di ffusion coefficient D and the consta nt c from 
Eq. ( B19[ ) can be extracted by expanding Eq. (B21 ) in w 
and Q. After some algebra one finds that in both cases 



they are regular. Henceforth 



2 ? 

A « ^ ^ 



r2 



r 



d'^Q c 



(27r)2DQ2y (27r)2 



(B31) 



X [I* (g) s^]^,^[r g) S:,]jj:G^,. {k)Gj J-k) 
xGl{-k)G^^^{k){t,t,\d){z2,H\d). 



c = 8v, 



D 







2r 



E 



(B27) 
(B28) 



Keeping only the divergent part of the integral over Q, 
and noticing that the index summation is actually a 
trace, we have 



Note that we keep c although it is of lower order in T/E, 
since it is associated with the divergence of the Cooperon. 

The leading term of the conductivity (Txx is given by 
"2 r d^p 







2Trh J (27r)2 



TiiJ,G''{p,EF)J.G''ip,EF)] 



h ttvq^ 2 h 2 T' 



(B29) 



The first quantum interference correction Scr^^, which is 
depicted in Fig. [7][a), is given by 



^ 2nh J {2tt 



f [J.].'.[^.]«'G;.(fc,£;i.) (B30) 
X G^. (-fc + g, EF)Gf,^ i-k + Q, Ef) 



xG^^.(fe,i?^)C(,^.)(,^.)(fe,fc;i?^,Q,0). 

The divergent contribution to the correction comes 
from the limit of Q = in the Green's functions, where 



(B32) 



d^k 
(2^2 

X Tr [G^(-fe)(r (g s^)G^{-k)\d) 

X G^ikfir^s^fG^'ikfld)] 



where the matrix \d) is defined by = {ij\d). Using 

the fact that 



G^(-fc)(r g) s^)G"i-k) cx 2v^^k^ky{I (g Sy) 



+ (£;2 + r2 + «o2fc2 _ y^^kl){r ® S,) 

- 2Evok^{I «) /) - 2vokyr{r g) s^), 



(B33) 



the only non vanishing and non negligible traces are those 
which are proportional to Tr[(/* (g Sx)\d){I* g) Sa;)|d)], 
Tr[|d)(/ ® Sy)\d){I (g Sy)], and Tr[|d)|d)]. The resulting 
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correction is now 



(5tT° 



Tr[|d) 



|ln(L) 



An^h D 



fx 



(27r)2 [{E + ^^)2 - v^'^k^Y[[E - iVf - v^'^k'^f 



-Tr[M) 
(Pk 



\HL) 
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e Uq c 
2^D 



(27r)2 [(S + ir)2 



Trf 



|ln(L) 



AE^ 

w 

E 



AiT^hD-' i6r3 



1 



(B34) 



where we replaced [\d)Y with T^, since they are equal. 

As first noted by Ref:'^P, the extra quantum corrections 
to the conductivity, which are depicted in Figs. [7][b,c), 
are non vanishing due to the independence on momenta 
of the current vertex. These two contributions are equal, 
and are given by 



Sal 



f 



(B35) 



(B36) 



j:iWiTT[Ml(g>s,)Ai'^{I(g,s,)] WTI 
Y,i wiTr[Ai{Ty (g) Sx)Ai^{Ty (g) s^)] graphene 



Since —1 < /' < 1, the sign of the quantum correction is 
still determined entirely by T^. 

-1. 



We have shown above that T^^ 



1 while T 2 



Therefore we can conclude from Eqs. (B34|-(B36| that 



spinless graphene tends to be localized, while a WTI flows 
towards perfect conduction. 



Appendix C: Weak localization and the 
time-reversal operator 

In this appendix we provide a straightforward expla- 
nation for the fact that the sign of the weak localization 
correction is the same as the sign of the time-reversal 
operator squared (T^). To this end, we express the 
Green's function as a sum over amplitudes associated 
with trajectories. Similarly, we express the return prob- 
ability as a sum over products of such amplitudes. The 
coherent contributions that give rise to weak localiza- 
tion/antilocalization come from products of time-reversal 
conjugate trajectories. By analyzing the action of T on 
trajectories the above relation is established. 

Consider the Dyson series for the Green's function G, 



(CI) 



n=0 



where G° is the clean Green's function, and V is the dis- 
order potential. The matrix element of G that connects 
the lattice site i and spin state a with the lattice site 
j and spin state a' may be written as a sum over tra- 
jectories that connect these two sites and spin states, 
and which go through a series of intermediate points 



{ia, i„o-„, i„_icr„_ 



G ■ , — S^ A" 



711*71— lO" 71— 1 



(C2) 



Given that the system is symmetric to some anti- 
unitary operator, most notably the time-reversal oper- 
ator T, we define |(t) = S^aT\(T), where = ±1- Conse- 
quently, G! 



Co-^o-'G^g./ jg. and Via-Ja' — S,rT^a'Vja' 



A straightforward manipulation then yields 



(C4) 



where a = {ia'^iiai, ...,ia). Note that all the sign fac- 
tors except ^o- and appear twice, and therefore are 
canceled out. 



Using ( C2 1 we find that the probability of a particle 



to return back to its initial site, with perhaps a different 
spin state, is given by 



\G 



E 



A" 



' (-^i(T,icr') 



(C5) 



Two types of pairs of trajectories contribute coherently to 
the disorder-averaged double sum in equation (C5 ), since 



their phases do not fluctuate. The obvious contribution is 
the classical contribution consisting of pairs with a = a'. 
However, due to T— symmetry, an additional contribu- 
tion exists in which a comes paired with a. Comparing 
equation ( C4 1 with equation ( C5 ) one finds that pairs of 



time conjugated paths may appear only if a' — a. There- 
fore whenever it appears, the sign factor of such term is 
= {a\T'^\a) ~ sign(T2). Also notice that the size 
of this term is equal to the size of the classical term, 
and therefore may either double or suppress it. Hence 
for ^l(l) the probability for a diffusing particle to 

return to its original position is higher (lower) than the 
classical probability, and this is an indication for weak 
anti-localization (weak localization) . If the Hamiltonian 
commutes with more than one anti-unitary operator, for 
example in the case of a spin independent Hamiltonian, 
the total correction is composed of the contributions from 
all the different trajectories with a' = a. 
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